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Final Report 


Automatic Photointerpretation for 
Land Use Management in Minnesota 


Abstract 


Automatic Photointerpretation techniques were utilized 
to evaluate the feasibility of data for land use management. 

It is shown that ERTS MSS data can produce thematic maps of 
adequate resolution and accuracy to update Land Use Maps. In 
particular, five typical Land Use areas were mapped with class 
ification accuracies ranging from 77% to over 90%. 
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I. Intr oduction 

Our major objective in this study has been to apply auto- 
matic photointerpretation techniques to land use thematic map- 
ping from ERTS MSS data. The land use classes selected were 
guided by those used in the 1969 Minnesota State Land Use Map, 
i.e, forested, cultivated, open, water, marsh, urban residential, 
urban non-resident ial, extraction and transportation. The 
1969 map was prepared from aerial photographs and manual inter- 
pretation. 

The study approach was a set of experiments using ERTS 
MSS data. Five geographical areas in Minnesota were chosen for 
analysis. A multiclass (K-Class) automatic classification pro- 
cedure was utilized to prepare thematic maps. The fifth experi- 
ment consisting of a lake count was performed by setting an 
upper threshold on band 7 for detecting water bodies. 

In one of the experiments, the automatic classifier was 
trained on a training area and tested on another area approxi- 
mately 25 miles from the training site. This experiment was 
performed to determine the generality of training weights. 

There was no noticeable degradation in performance over this 
distance . 

The addition of texture features was evaluated on the 
metropolitan area. Texture improves classification performance 
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but its tneasurement requires larger areas of the same class. 

Thus the ground resolution is degraded. 

The quality of the input data was generally good. In 
several of the images, one of the six channels either put out 
an attenuated signal or experienced random drop-outs. To 
accommodate this error source, these channels were delt with 
separately, i.e. separate classification weights were deter- 
mined for any channel whose output was noticeably different 
from the norm. 

I I . E xperimental Design 

The data used for constructing the feature vector was obtained 
from 7-track 800 BPI computer compatible tapes. The bulk data 
on black and white 9.5 inch positive transparencies was used 
to locate the area of interest. The ERTS 9^5 inch bulk photos, 
the RB 57 aerial photos, county maps and Mark Hurd aerial sur- 
vey photos were used to select the areas to be analyzed. These 
data blocks were then extracted from the four computer compat- 
ible tapes (CCT) . For most of the small data blocks, only one 
digital tape was required. When two tapes were needed to 
cover the desired area, alignment of data between the two tapes 
was achieved by aligning data having the same record number. 
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The extracted data block from the CCT was reformatted for 
writing out on an Optronics film writer or on a line printer 
which printed out a grey map. For writing out the image, the 
dynamic range of the film writer was matched to the density 
values of the image as determined from a histogram plot. The 
Optronics films were written out in 64 equal density level 
increments. The line printer grey maps were composed of over- 
writing four alpha numeric characters to achieve 16 density 
levels. These levels were also assigned on the basis of the in- 
tensity histograms. An attempt was made to achieve different 
density symbols for the various classes. 

The selection of classes was guided by the Minnesota Land 
Management Information System (MLMIS) classes. Of the nine 
classes used in this system, seven were included in this study. 
Cultivated regions were not considered because of the unavail- 
ability of current ground truth. In addition, the transporta- 
tion class was not broken out as a separate class because of the 
lack of training samples. 

Data Analysis Procedure 

After receipt of the ERTS data on photographs and on digital 
tapes, the photographic images were checked for gross anomolies 
in the detector operation. When banding was apparent, histo- 
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grams were computed for each of the six detector scan lines to 
detect differences in the mean and variance in the output 
each detector. When large differences occurred in the output 
of the detector channels, they were processed separately. 

The ERTS data was registered with available ground truth 
(obtained from aircraft photography and sample ground inspec- 
tion) by recognition of landmarks. For example, lakes on MSS 
band seven provided good landmarks for registering the two 
images, hn enlarged view of the ERTS image landmarks were 
obtained by converting the ERTS digital tape to film with 


Optronics Digital tape-to-film writer. The film record output 
is an image containing grid lines corresponding to record and 
word numbers on the COT. In addition, the landmarks are dis- 
played in context on the image, thus simplifying photointer- 
pretation and registration. This procedure allows registering 
the tape to a known landmark to within a single resolution 
element. Registration of the ERTS data with ground-truth maps 
is essential for both training and evaluating the automatic 
interpretation system and for producing the stratification 
overlay. Registration is also essential for adding successive 

coverage data* 
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Once ground truth and ERTS data are registered, class 
boundaries can be encoded in temns of record and word numbers 
on the digital tape. From within these class boundaries, 
the training samples were extracted. The generation of the 
thematic maps was achieved by using a trained classifier. 
Training is performed by a "teacher", i.e. a training set of 
data is isolated for each of the classes of interest. The 
training set is used to generate decision weights as follows. 

Samples of each class from the training set are grouped 
together in a least-mean-square sense about points in decision 
space which are equidistant from each other and from the origin. 
Linear boundaries are determined to minimize the. error assign- 
ments for this training set. These boundaries or decision 
weights are then used to categorize the input data during the 
formation of the thematic map. While generating these decision 
boundaries, a confusion matrix is printed. This provides per- 
formance data for the operator to use in evaluating the expected 
accuracy of classification. 

By using the spectral distribution of energy over the 
classes of interest, it i^ evident that some of the classes 
are separable. For example, the histograms in Figure 1 indicate 
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that water and conifers are easy to identify on this October 
image taken over Cloquet# Minnesota. Because of the leaf- 
off condition, hardwoods are not as easy to separate from open 
areas and cities. Adding a January coverage greatly improves 
separability. 
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Figure 1, Histograms of Intensity Levels 
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A block diagram depicting the chain of events in the auto- 
matic photointerpretation procedure is shown in Figure 2. 

The following discussion is centered around Figure 2, The 
data block from the 100 by 100 mile ERTS image will be selected 
by longitude and latitude specifications. Finer resolution 
can be obtained by using aerial photos and landmarks for de- 
picting the desired area. Because of the earth’s rotation, the 
ERTS photos are not aligned to the north despite the polar orbit. 
This requires a slightly larger rectangle of data to include 
all of the area described by corner points expressed in terms 
of longitude and latitude. 

The ERTS bulk tapes are then re formated to be compatible 
with a film writer. Data from each of the bands is written 
out using 6 bits per byte. The data is also re formated so that 
intensity levels from the four bands for each pixel are adjacent 
to one another for the automatic classifier feature vector. 

One channel of the selected data block is then written out on 
a film writer and registered with ground truth. If dual cover- 
ages are used, registration is made with a second coverage. 
Registration between dual coverages (October and January) over 
Northern Minnesota required translation in longitude and lati- 
tude and a rotation of 0.03. degrees of one coverage with respect 


to the other . 
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Figure 2. Automatic Photointerpretatioui Procedure 















- 10 - 


The classes selected should be guided by separability, that 

is, by some insight into the natural clustering of the data. 

» 

Once the classes have been chosen, training samples are selected 
from representative areas to determine classifier weights. 

The K-Class training procedure generates a confusion matrix, 
which permits judging the performance. The confusion matrix 
lists the actual ground-truth-versus-classifier assignments 
for all samples, thus one can determine where the most errors 
are made. A cost procedure can also be used for penalizing 
certain misclassif ications , thus reducing this type of error. 

After obtaining an acceptable classification performance 
on the training set, these classifier weights are used to 
categorize the selected blocks of data. Ancillary information, 
such as aspect or geographical location, could be fed into the 
categorizer at the same time to improve classification accuracy. 
The output of the categorizer is a digital magnetic tape with 
each pixel coded into one of the prescribed classes. 

The color coded thematic map is generated from the thematic 
map on the digital data tape by writing out the coded pixels 
into three black and white transparencies . The 

transparencies can be formed such that any class can be assigned 
to any desired color. The color spectrum can be divided into 
any required number of colors, however as the required number 
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of colors increases, the change in hue between classes decreases 
The three primary color black and white transparencies are then 
color combined into the final color-coded thematic map. 

III. Experimental Results 
Ramsey County Lake Study 

This study was run using Ramsey County as the test area, 
due to the availability of an ERTS coverage and easily access- 
ible ground truth data. The ERTS photo 1075-16321, an October 6 

1972 coverage was used. Lake acreages were determined by set- 

ting a threshold on band 7 and counting areas below this thres- 
hold level. Open water generally provides a signal of 5 or 

less, marsh areas can be detected in the range of 6-9. 

An inventory of water resources is valuable in determining 
water resources, recreational areas, and storage capabilities. 
Presently, this is done from aerial photos which are costly, 
time consuming, and, therefore, not taken often. Potentially, 
ERTS photos offer an 18^day inventory updating system. This 
information could also be used to improve the water information 
on county highway maps since it is not unusual for existence, 
size, location, and/or shape of a lake on these maps to be err on 


eous . 
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More rapid changes can be observed by repetitive coverage 
such as that provided by ERTS. These involve changes in shore- 
line and lake size due to natural processes and development, or 
the side effects of development. The resulting changes can 
effect waterfowl, fish and also legal matters; e^g., the bring- 
ing' of unused shallow lake lands into productive use involves 


a taxation change on that land. 

‘ The monitoring of changes in lake levels and lake occur 

rences is also of great value in developing an index of annual 
waterfowl production. Canadian and U. S. biologists suggested 
that a reliable production index could be derived independent 
of size of the breeding population from estimates of the number 
of ponds remaining in mid-July.* Automatic data processing 


of changes in wetness during the May- July period would 
provide this. 

For this study, a print out of tha band 7 was used. Since 
each data point is approximately equal to 1.5 acres on the 
ground, one had only to count up the groupings of numbers and 
multiply the obtained total by a correction factor. To account 
for element overlap along the scan lines, the expression 
1.104N + .453L was used. In the expression, N is the number 
of elements and L is the number of scan lines involved in the 


composition of any irregular shaped object. 

*»Prelimin ~^y Evaluation of ERTS-1 for Determining Numbers 
and Distribution of Prairie Ponds and Lakes"; Edgar A.Wor , Jr., 
Environmental Research Institute of Michigan; David S. Gilmer 
and A, T. Klett, Bureau of Sport Fisheries and Wildlife Northern 

prairie Wildlife Research Center • 
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The derivation of this relationship is shown below: 


3 ^ 7 



b = 56m 


Area of one strip = a [a + (n-1) b] 

I* 

Area of L strips = S a[ (a-b) + n, b] 

A=1 ^ 

L 

= L a (a~b) + ab E n. 

-t=l 

= L a (a-b) + ab N = (1817L + 4424N) meter^ 
A = 0.453L + 1,104N acres 

After comparing the shape of several lakes on the computer 
print out with their respective shape on topographic maps, the 
interval of 0 through 5 on band 7 was selected as being indi- 
cative of water. The acreage of each lake in Ramsey County was 

then counted and compared with available ‘data on basin and lake 
acreage . 

It Should be pointed out that the interval selection is 
somewhat arbitrary. Any error in the choice of an interval 
should produce consistant errors in acreage, i.e., consistantly 
too high or consistantly too low. 
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The acreages determined from the ERTS data were compared 
with available data. Any lakes in Ramsey County not included 
in this study did not fall within the area pulled from the 
ERTS photo. 

A glance at the counted acreage as compared to basin 
acreage and lake acreage reveals that, in most cases, the 
counted acreage provides a good measure of the actual acreage. 

The comments and lake types included are an attempt to account 
for the discrepancies that arise. 

In comparing the figures, one should bear in mind that 
basin acreage indicates the area of the basin, i.e., the depres- 
sion, not the actual area that contains water. Likewise, the 
lake acreage includes most marsh areas around the lake, not 
just areas of open water. The chosen interval of 0 through 5, 
on the other hand, seems to indicate only open water, or water 
with a very slight amount of emergent vegetation or rushes. 

This can be seen by looking at some of the lakes indicated as 
eliminated by air photo' in the comment column. 

The phase "eliminated by air photo" indicates that a parti- 
cular lake was considered to contain no open water, i.e., to 
be drained or swamp, from analysis of 1968 low altitude air 
photos of Ramsey County. As can be noted from the table, several 
of the eliminated lakes are shown as containing small amounts 
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of water. These are likely lakes that contain water in only 
part of the year and are swamp or dry for the rest. Some of 
the lakes, for example 62-20, in, the upper right quadrant of 
Figure 3, can be made to assume their exact basin shape by 
considering the numbers 0 through 9. This seems to imply that 
the lake is surrounded by detectable swamp (this lake is shown 
as surrounded by swamp or marsh on the USGS topographic sheet) . 

Other lakes, for example 62-23, indicated as swamp from 
field checking, are not visable at all until the interval 6 
through 9 is noted, in addition, an area just north of Pigs 
Eye, 62-4, shows up in the interval 6 through 9 and is design- 
ated swamp on the topographic sheet. It seems, then, that 
this interval, 6 through 9, could be used to identify swamps. 

This technique is not useful for determining depth in 
lakes. While the data shows that many of the counted and 
planimetric calculations are close, this similarity is coin- 
cidental. A comparison of the areal distribution of lake 
acreage having a depth of 10 feet or greater, as determined 
from the Division of Water, Soil, and ilinerals map and this same 
acreage determined from the computer print out, shows no 
correlation. This lack of correlation is likely due to such 
factors as sun angle and lake water pollution. 



Lake NO. 
62-1 

62-4 

62-5 

62-6 

62-7 

62-8 


62-12 

62-13 


Name 

Silver 

Pigs Eye 

Caseys 

Kohlman 

Gervais 

Savage 


Round 

Phalen 


counted 

Acreage 

60.68 

547.91 

8.02 

72.07 

205.07 

17.32 

t ■ 

I 

10.24 

70.93 

15.58 

18.01 

191.09 


Basin ^ 
Acreage 

68 

511 

14 

84 

234 

46 

14 

72 

23 

23 

193 


62-9 Round 

62-10 Keller 

62-11 Wakefield 


Lake 

Acreage ' 
73 


206 


192 


62-14 


no name 


6.91 


18 


5 

Counted 

Acres Acres 

Depth s Depth ^ 6 

10 feet 10 feet' Type 

V 

IV 

VA 

VA 

135 138.75 V 

IV 

VA 

VA 

VA 

V 

124 127.65 V 

VA 


7 

Comments 

seems to match exactly 
the 5 ' contour 

much swamp to east & 
south 

basin slightly large r, 
mainly open water 

basin (depression) 
larger 

basin larger than lake 
sunf 

13 5E now cuts through 
lake; basin much larger 
than water area 
reeds & emergent veg. 

basin larger than 
actual lake 

basin larger 

1/5 shoreline is marsh; 
basin slightly larger 

marsh in area to north 
that appears to be basin 

basin slightly larger 

basin larger than water 
area; marshes to NW & 

SE 


Lake No. 


Name 


Counted 

Acif^eage 


Basin 

Acreage 


Lake 

Acreage 


62-15 Sandy not 22 

vi sable 

62-16 Beaver 68.91 65 

62-17 no name 17.35 24 

62-19 Black 9.78 120 

I 

62-20 no name 3,78 22 

62-21 Tamraarack 1,56 69 

62-22 no name 1.56 10 

62-23 no name not visable • 14 

62-24 Birch 107.56 127 

t ■ 

62-25 Ox 7.36 13 

62-26 no name 5.80 12 

62-27 Gillfillan 89.80 87 

'62-28 \ Sucker 56.70 59 60 


62-29 


Basswood 


not vis 


110 



|Ac^^^ed. 

Acres r Acres 

Depth 2 : Depth ^ j 

10 feet 10 feet ( Type 


Comments 


VA sludge basin, St. Paul 
water supply 

VA marsh to NE; small 
bit to south 

IV marshy & swampy 

IV surrounded by marsh; 

eliminated by air photo 

rv eliminated by air photo 

III eliminated by air photo 

VA marsh; inaccessible 

VA swamp 

VA 96-county road G-now 

•goes through south end 

VA pond, swamp, some open 
water; 96-county rd. G 
goes through 

IV southwestern & northwest- 
ern shores-marsh, narrow 
strip connecting top & 
bottom 


25.53 V St, Paul water supply 

III eliminated by air photo 



Counted Basin Lake 

Lake No. Name - Acreage Acreage Acreage 

62-30 Lambert not vis. 542 

62-31 Grass not vis. 84 

62-32 Rice 1.563 121 

62-33 no name not vis. 87 

62-34 Goose 122.64 152 

62-35 no name 2.67 10 

62-36 Priebe 3.13 17 

62-37 Gem 16.24 20 

; 

62-38 Vadnais 568,52 477 


62-39 Twin 25.12 37 35 

62-40 Willow 30.47 ^75 

62-41^ 


no name 


8.02 


15 


Acres 
Depth ^ 
10 feet 

cflM:ed* 

Acres 
Depth S: 
10 feet 

Type 

Comments 



III 

eliminated by air photo 



•III 

eliminated by air photo 



III 

eliminated by air photo 



IV 

eliminated by air photo 



VA 

US 61 goes through 



IV 

eliminated by air photo 



VA 

narrow; surrounded by 
houses now; used as 
neighborhood pond 



VA 

marsh at SE end, basin 
slightly larger than 
lake 



V 

marsh in SW & NE has wate 
on printout, also water 
separate from lake in 
marsh; road there too 
narrow to show; St. Paul 
Water 

25 

12.21 

V 

some marsh to SE & NS 



IV 

surrounded by marsh, basin 
(depress ion) much larger 
than water surface 



VA 

' surrounded by marsh; very 
shallow, some construc- 
tion one side & homes; 
basin large 















J 




N. 





Acres 

Acres 



Lake No. 

Name 

Counted 

Ac:^eage 

Basin 

Acreage 

Lake 

Acreage 

Depth s 
10 feet 

Depth s 
10 feet 

Type 

Comments 

62-42 

no name 

6.9L 

12 




VA 

basin slightly larger 

62-45 

Long 

4,42 

111 




III 

eliminated by air photo 

62-46 

Pleasant 

553.88 

585 

627 

369 

376.29 

V ] 

marsh at northwestern edg< 

62-48 

Bennett 

19.57 

41 




III 

surrounded by marsh; 
eliminated by air photo 

62-49 

Ian ton 

1 

16.28 

35 




IV 

surrounded by rushes, 
some open water, emergent 
veg. 

62-50 

Wilson 

1.56 

19 


1 


IV 

eliminated by air photo 

62-51 

Robinson 

3.78 

28 




II 

eliminated by air photo 

62-52 

Poplar 

1.56 

19 




IV 

eliminated by air photo 

62-53 

no name 

9.,13 

15 




VA 

exit ramp from route 36 
to route 51 cuts into; 
some marsh, golf course 
on cast, housing to north 

62-54 

McCorron 

62.20 

71 

70 

50 

44.40 

1 

V 

a little marsh in NW, 
basin slightly larger 

62-55 

Como 

63.36 

69 

.72 

18 

7.77 

VA 

basin slightly larger 
than water surface 

62-56 

Cwasso 

333.28 

360 

355 

101 

241.98 

V 

marsh at southern and 
western end; development 

62-57' 

Josephine 

112.20 

110 

115 

51 

69.93 

V 


62-58 

Little 

Johanna 

16.24 

18 




V 

surrounded by marsh 








Turtle 


418.46 


62-62 


Charley 


29.56 


62-63 


no name 


3.13 


Martha 


10.24 


Sunf ish 


7.36 


Round 


100.90 


Valent ine 


50.48 


Karth 


9.58 

I 


62-73 


Snail 


138.42 


62-74 


Grass 


1.56 


62-75 


Island 


44.29 


62 - 7 ^ 


Jones 


8.02 


28 


!lo\JTTC^d 
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Lake No. 

Name 

Counted 

Acreage 

Basin 

Acreage 

Lake 

Acreage 

Acres 
Depth s 
10 feet 

Acres 
Depth a 
10 feet 

Type 

Comments 

62-78 

J ohanna 

1 

189.49 

211 

225 

162 

143.19 

V 


62-79 

1 

no name 

6.46 

11 




VA 

pond 

62-80 

Emily 

5.80 

12 




VA 

highway along eastern 
length houses on western 

62-81 

Mud 

j 

9.13 

16 



i ■ 

VA 

surrounded on western 
half by houses & rushes 
on eastern 

62-82 

Wabasso 

36.67 

47 1 

46 

20 

17.76 

V 

surrounded by marsh (or 
rushes) about 50 yds. 
to open water 

19-79 

\ 

Pickerel 

94.94 

52 




IV 

steep side to SE (causes 
shadow marsh to S & NW 
toward river; RR through 
the top 

82-167 

White Bear 

635o|.363 

2383.57 

10965 

2410 

2479 

1924 

2065.71 

V 



For location of lakes consult map on page 9, 

1. number of points on computer printout in the range 0 through 5 multiplied by 1, 104N + ♦ 453L 

2. acreage of lake basin as indicated by Bull. 25, Div. of Water, Soils & Minerals, Dept, of Natural Resources, 
from 1949 data 

3. acreage by planimetry from Div. of Water, Soils, & Minerals maps ranging in age from lake to lake 

4. acreage by planimetry from Div, of Water, Soils, & Minerals maps ranging in age from lake to lake 

5. number of points on computer print out in the range 0 through 1 multiplied by 1. 104N + , 453L 

6. Lake type established by Metropolitan Inventory of Lakes, Dept, of Conservation, Fish & Game Commission 

\ II - grass on sedge meadows IV - deep marsh VA - marginal fish - game lake 

III - shallow marsh V - fish lake 

7. comments refer to field checking and air photo interpretation (1968 - Mark Hurd photos) done for Ramsey County 
Lake Surface Zoning Study and reference to topographic maps (USGS - various years) 
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One problem encountered in this study was that things other 
than water fell in the interval 0 through 5. Along with water 
were cloud shadows and heavily industrialized areas. It seems, 
however, that consideration of band 5, which enhances cultural 
features, will enable the elimination of those areas in the 
interval 0 through 5 which are not water. On band 5, the 
heavily industrialized areas seem to show up as much lower 
numbers than the lakes. Cloud shadows can be distinguished 
by associating them with clouds on band 5. By looking in the 
appropriate direction, given the time of day of the^ photo and 
the time of year, one encounters an area, somewhat smaller 
than the cloud shadow, of very high numbers, higher than any 
thus far encountered in this study. Thus, by scanning the 
photo, cloud shadows can be distinguished from water. Alter- 
natively, photos from different dates could be compared to 
detect the presence of clouds. 

Sandy Lake, 62-15, was not visable. It is likely that 
62-15 was nearly dry at tha time of the photo; it is a water 
supply lake, and therefore highly fluctuating. 

It seems that lakes down to a two acre size can be located 
by this technique. This is a much better performance than that 
obtained by a visual mapping project conducted by the Department 
of Geography of the University of Minnesota. They had approxi- 
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mate ly a ten acre threshold viewing size for small water bodies. 
They also noted difficulties in the discrimination of lakes and 
cloud shadows and of lakes and freshly plowed fields.* The 
latter difficulty was not encountered in this study due to the 
lack of agriculture in Ramsey County. 

In conclusion, it seems that the technique of determining 

« 

and monitoring lake acreage would provide adequate information 
for the various purposes discussed earlier. 

Numerous water areas not designated as lakes, perhaps due 
to their periodic nature, were noted on the computer print out. 
These corresponded to areas of water or swamp on the USGS topo- 
graphic maps. Since the appearance of lakes is a reflection 
of the ground water level, water fluctuation in a swamp area 
of known elevation could be used to calculate the water table 
fluctuation. This fluctuation may be due to natural causes 
or human intervention. This information, when combined with 
other information dealing with inflow and outflow from a lake 
by surface systems, would enable the determination of mass 
budgets for lakes. ^ ^ 

* "Application of ERTS-1 Imagery of State-Wide Land Information 
System in Minnesota ", ERTS-1 Application to Classification 
and Happing of Water Resources in Minnesota; ">wight A. Brown, 
Ralph Sanders, Jack Tlynn, and John Harrington, Department 
of Geography, University of Minnesota. 
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This acreage calculation method could also be valuable in 
the typing of lakes. A comparison of counted acreage with 
basin acreage and counted acreage from 6 through 9 could lead 
to a useful classification of lakes. Jack Flynn, working on 
the Ramsey County Lake Surface Zoning Study for the Minnesota 
Land Management Information Systems Study, felt that this 
computerized evaluation technique would be of great value to 
the classification of lakes. Some lakes, not eliminated by 
air photo, were discovered to be marsh on field checking. 

Others were discovered to be inaccessible due to being completely 
surrounded by marsh. A computerized evaluation using ERTS- 
data rather than outdated aerial photos could have- led to more 
efficient use of costly, time-consuming field work. 
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Trout Lake Area Study 

This area was chosen for analysis because of the availability of an 
ERTS cloud-free coverage and the availability of groimd truth. The ERTS 
frame used for the data source was 1075-16312, an October 6th overpass. 
The ground truth was obtained from aerial photos, county maps, quadrangle 
maps and a land use map produced by the Department of Geography of the 
University of Minnesota from ERTS photos. A line printer grey map of band 
7 was used to register the ERTS data with lakes and rivers on the ground 
truth map. The 4/24/69 aerial photo for the groimd truth map was taken 
by Mark Hurd Aerial Surveys, Inc, The University of Minnesota land use 
map was the principal source of grovind truth with the aerial photo being 
used as a verification of the uses assigned on that map. 

Land use, for this particular Iron Range study, was divided into nine 
classes: 


Class 


Name 

Number 

1 


Hardwood 

11 

2 


Conifer 

12 

3 


Water 

31 

4 


Water in mines 

32 

5 


Wetlands 

40 

6 


Mines 

61 

7 


Tailings 

62 

8 


Open 

70 


9 


Urban 


80 
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These classes represented all the distinct land uses within the area. 
Wlien a class sample was identified on the density level map, it was delin- 
eated and labeled. This procedure was continued until the number of samples 
within each class was sufficient to establish the class’ character. With the 
character of each class determined, automatic classification could proceed. 

In an effort to deci'ease the number of classes, the original nine 
classes were compressed to six in the following manner. The compression 
was done by using the minimum, maximum, mean, and standard deviation 
information as well as information about the land uses themselves. The 
nine classes were merged into six as follows; 

Feature New Class 

Hardwood (11) 1 

Conifer (12) 2 

Water (31) 3 

Water in mines (32) 3 

Wetlands (40) 4 

Mines (61) 4 

Tailings (62) 4 

Open (70) 5 

Urban (80) 6 

Wetlands were put with mines and tailings not only because their numerical 
values were similar; but, also, because their occurrence in this area seems 
to be mostly the result of mines and tailings (i. e., part of tailing ponds). 

The training set contained a total of 2855 samples with 543 in class 1, 
381 in class 2, 327 in class 3, 1256 in class 4, 214 in class 5, and 134 in 
class 6. In order to determine the separability of classes, the mean, stan- 
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dard deviation, jriirLLrriuiTi and niaxiin.uni were computed for each class, 
for each spectral band and for each of tlie six channels per band. 

The difference between the means for each class compared to the 
difference between means per channel is a measure of signal to noise ratio. 
This data, which is listed in detail in Table II, is plotted in Figure 2. The 
ratio of a (class )/ a (channel) indicates a relative signal to noise figure. 

If tliis ratio decreased to unity, it would indicate that the banding effect over- 
shadowed class differences and a need to normalize the channels. 

The training data used to train the automatic classifier; that is, a set 
of weights, were determined such that the training set was clustered around 
points equidistant from one another. When this set of weights is determined, 
an iteration is performed on the a priori probabilities to reduce the number 
of misses. 

Each iteration of the K-Class program provides the number of misses 
along with a confusion matrix by percent and by samples. The minimum num- 
ber of errors was obtained in the sixth iteration. An overall accuracy of 77. 6% 
was achieved (see Table III). 

Upon reaching the minimum number of mis classifications, the iterations 
are terminated and a list of the misses at that point are printed out. The list 
contains the ID number of the element, its assigned class, and the class de- 
cided upon by the K- Class program. A comparison of this information with 
the input data from the four bands and the minimum- maximum data for each 
class aided in the explanation of particular misses. 

Some misses arise from the erroneous designation of training samples. 

A comparison of the printout of sample location, designation, and data of each 
band and the maximum- minimum information with the density level map reveals 
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Table II, 

Probability Distribution of the Data as a Function of Channel 
Over the Four Spectral Bands 
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Table II {contiimed) 
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Table H (continued) 
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Table II (continued) 


■ CI.ASS c 32 ----- 4 -- Water in Mines 
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Table II (continued) 


CI.ASS 9 40 5 Wetlands (w/Tailing & Mines) 
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19/ 

MAX .3 

25/ 

mean 

s 

22 

* ^46/ 

ST.DV* 

s 

2 

• 1428 

band» 

= 5/ 

MIN.s 

13/ 

^:AX« = 

23/ 

MEAN 

= 

13 

.949/ 

ST.Dv. 

s 

3 

♦ 0963 

band* 

* 

M ! N . = 

12/ 

MAX . 3 

29/ 

MEAN 

B 

23 

.103/ 

ST.Dv 

5 

3 

.3034 

rA\D» 

= 7/ 

M J N ♦ a 

4/ 

MAX«s 

16/ 

MEAN 

C 

11 

*538/ 

ST .DV* 

c 

2 

• 2854 

channel s 

3 













band. 

a 4/ 

M I N . a 

Ifc/ 

MAX . , 

33/ 

mean 

2 

22 

*632/ 

ST.DV 

s 

2 

♦ 7355 

BANOi, 

a 5i 

H I N . a 

14/ 

MAX ♦ a 

3C'/ 

MEAN 

:: 

19 

• 395/ 

ST i D V * 

5 

3 

•3337 

fjAN’D, 

s 6/ 

M ! N. a 

iB/ 

MAX . a 

29/ 

MEAN 

s 

23 

•105/ 

ST.DV. 

s 

2 

.9b39 

euno* 

a 7f 

M I N . S 

fc/ 

MAX * a 

15/ 

ML AN 

S 

12 

•105/ 

ST.Dv 

- 

1 

.7739 

channel « 

4 

f ^ 












f)AND* 

a 4/ 

Ni f N . = 

20/ 

MAX.s 

25/ 

MEAN 

3 

23 

*02'*/ 

ST.Dv. 

s 

1 

• 6303 

p.anD* 

s 5/ 

M ! N ^ a 

{ it t 

MAX . a 

2" < 

MT AN 


i O 

. p A 4 . 

CT , l>u . 


n 

, < T *a 

« 1.. i « 1- 

hand* 

s A/ 


1 / / 

MAX . a 

22/ 

MEAN 


23 

♦ 317/ 

ST.Dv. 

= 

2 

.8149 

sand* 

= 7/ 

H I N . = 

b/ 

MAX *r 

16/ 

mean 

•X 

12 

• 634/ 

ST . DV • 

5 

2 

• 0572 

channel s 

5 













band. 

s kf 


18/ 

max .3 

27/ 

MEAN 

s 

22 

.882/ 

ST.Dv 

S 

1 

.7784 

B AND, 

C 5/ 

1^ } N) , 3 

15> 

MAX .3 

25/ 

mean 

3 

19 

♦353/ 

ST.Dv 

X 

1 

.9385 

band. 

= 6-» 

, J N . S 

1 4/ 

MAX .a 

30/ 

mean 

X 

24 

.176/ 

ST.DV. 

S 

4 

.1761 

. BAND* 

= 7/ 

M J N • a 

4 / 

Max. a 

16/ 

mean 

X 

11 

• 676/ 

St*OV . 

3 

2 

• 6977 

channel a 

6 




• 









band. 

a 4i 

MIN. a 

20/ 

MAX .a 

27/ 

mean 

s 

23 

.730, 

ST.Dv. 

S 

1 

.9952 

e^and* 

a 5# 

M I N , a 

lb/ 

.MAX . a 

29/ 

me. AN 

= 

21 

•027/ 

ST 1 D V . 

s 

2 

• 9727 

band* 

a &* 

"* 1 N . a 

13/ 

MAX ,a 

29/ 

mean 

r 

23 

.56-3/ 

ST.DV. 

= 

3 

•7166 

, BAND* 

s 7* 

M;N.= 

6/ 

M aX • a 

15/ 

MEAN 

• 

11 

• 784/ 

ST.Dv* 

= 

2 

•2317 
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Table II (continued) 


f* ^ 

CLASS ; 

61 ^ 

« M « ^ 

6 — 

Mines 







CHAN\'P.t s 
tiASiO. 

!'•■ 
= h t 

MIN.=' 

n^K.e 

^';AX • c 

2S/ 

mean 


24 

.507/ 

ST.DV. 


1 

.7037 

oanD# 

= 5/ 

M I N , a 

\bf 

MAX . s 

3c/ 

MEAN 

S 

23 

» C 4 8 / 

ST .DV» 

C 

2 

•9285 

band* 

* 6^ 

MIN.* 

Itf 

MAX. a 

25/ 

MEAN 

r 

23 

.000/ 

Si.Dv. 


3 

.7352 

band. 

« 7 / 

MIN.* 

bf 

MAX. a 

16/ 

MEAN 

= 

10 

•157/ 

ST.DV. 

- 

2 

• 1368 

channel = 
band* 

2 

S 4/ 

M I N . 5 

20/ 

MAX • ' 

22/ 

MEAN 

B 

24 

.922/ 

ST.DV 

a 

1 

• 4524 

band* 

= 5 .• 

M I M ♦ » 

14/ 

KAX.S 

22/ 

MEAN 

r 

23 

.159/ 

ST.DV* 

- 

2 

•5511 

(3 ANiD Ir 

= 6^ 

MIN.* 


MAX .3 

229 

mean 

z 


• l56» 

ST.DV. 

s 

3 

.8839 

pAND* 

= 7> 

M I N . = 

^9 

max . r 

16/ 

mean 

• 

9 

• 34^ i 

ST.DV* 

s 

2 

•5029 

channel s 

ti A ND* 

3 

S 4> 

M 1 > > . = 

20/ 

MAX .s 

25/ 

MEAN 

c 

24 

•530/ 

ST.DV* 

** 

1 

.9096 

band* 

= 5* 

min.= 


A X ♦ 5 

30/ 

MEAN 

: 

23 

•3.61/ 

ST.Ov 

a 

2 

• 4768 

band* 

« i' 9 

H I N , » 

lb* 

MAX.a 

31/ 

MEAN 

= 

23 

•169/ 

ST t D V • 

B 

3 

• 2032 

band* 

? 7/ 

M J N . s 

6/ 

MAX.s 

if/ 

MEAN 


o 

•482/ 

SjiDV. 

S 

1 

.8971 

channel = 
band* 

4 ' 

* 4, 

M I N ♦ a 

21/ 

MAX.s 

22/ 

MEAN 

B 

24 

•212/ 

ST » D V • 


1 

.5305 

band* 

? 5/ 

min.= 

14/ 

MAX . a 

27/ 

mean 

S 

22 

♦ 787/ 

3 T . D V • 

s 

2 

• 5870 

band. 

= (if 

M I N . = 

14/ 

MAX .a 

3C/ 

MEAN 

S 

23 

• 200/ 

ST t D V • 

s 

3 

. 3667 

band* 

» 7/ 

M I N . = 

r 

b* 

MAX .a 

16/ 

mean 


10 

•000/ 

ST ♦ DV • 


2 

.3611 

channel » 

band* 

5 

= 4/ 

MIN. a 

ci* 

MAX ♦ c 

3?/ 

mean 

- 

23 

.772/ 

ST.DV 


1 

• 4073 

K ANDv 

• « 

M J k; . » 

4 i # 

U * V . _ 

' 

MEAN 


r." u. 

•ITA, 

I-.- T P* . . 

1.^ « • i>- V ^ 

- 


♦ <3 o / o 

band* 

= f; f 

M i N' . a 

13^ 

MAX . a 

34/ 

mean 


23 

.674/ 

ST.DV. 

r 

3 

.4801 

band* 

= If 

M I M . 6 

H 9 

max .= 

15/ 

MEAN 

- 

o 

•620/ 

ST.DV 

s 

2 

• 0999 

channel 5 

B A N D * 

6 , 
s 4i 

MIN.* 

2c 9 

MAX . a 

32/ 

MEAN 


P4 

•351/ 

ST.Dv 


1 

• 8725 

band* 

s 5/ 

MINI.* 

1^9 

MAX * a 

32/ 

mean 


23 

•103/ 

ST.DV. 


2 

• 59' 0 

band* 

= (if 

M ! N , a 

139 

r'AX.a 

36/ 

MEAN 

= 

23 

.713. 

ST.Dv. 


3 

.7475 

£AND* 

» 7/ 

M I N . * 

b9 

MAX . a 

17/ 

mean 

- 

10 

.368/ 

ST .OV. 

= 

2 

.3445 





-35- 


Table II (continued) 


----- CU>SS ; 62 ----- 7 -- Tailings 


CHA^;^JEL = 1-- 



II 

M I N , a ' 

120/ 

MAX,= 

32/ 

MtlAN 

1 



ST 

*DV • 


2 

*5S32 

rAND» 

= 5i 

M I N , a 

'16/ 

MAX *a 

35/ 

MEAN 

= 

23 

♦ 6 1 1 

ST 

*DV * 


4 

*8664 

ciANib* 


M I N . s 

lb4 

MAX.a 

34/ 

MEAN 

= 

25 


ST 

* D V • 

= 

3 

• 8035 

ban^» 

= 7f 

MIN*’ 

7j 

MAX, a 

15/ 

mean 

- 

11 

• 354/ 

ST 

*0v » 

fS 

1 

*9196 

CHAN,\EU = 

2 



*■ 











bamd* 

- 4/ 

MlN.a 

20/ 

MAX.r 

31/ 

mean 

U 

2A 

*539/ 

ST 

.DV • 

X 

2 

• 3178 

band* 

= 5* 

1 N . = 

16i 

MAX .a 

34i 

mean 


?? 

*551/ 

ST 

• DV ♦ 

= 

4 

•3005 

band* 

= 6j 

M 1 N . s 

17* 

MAX.S 

35/ 

MEAN 

s 

24 

*640/ 

ST 

* D V » 

s 

3 

•5450 

band* 

? 7j 

M I N ♦ * 

6^ 

MAX,= 

U/ 

MEAN 

e 

10 

*787/ 

ST 

• DV* 

= 

2 

*1799 

channel s 

3 

... ... 













band* 

= 4, 

MIN.= 

20/ 

K ^ X a S5 

35/ 

mean 

s 

23 

.845/ 

ST 

• DV * 


3 

•2385 

band* 

S 5/ 

min, a 

15/ 

MAX, a 

34i 

M L A fN 

a 

2l 

.976/ 

37 

* D V * 

z 

3 

• 7606 

band* 


MIN.= 

17/ 

MAX. a 

31/ 

mean 

s 

2.3 

* 8 A 9 / 

ST 

• DV* 

3 

2 

*9672 

band* 

= 7j 

M I N , » 

6i 

MAX *a 

16/ 

MEAN 

s 

10 

.905/ 

ST 

• DV* 

= 

2 

*3331 

channel * 

4 














B a.ND*. 

= 4/ 

M I N . = 

20/ 

M A X , c 

37/ 

mean 

9 

24 

*465/ 

ST 

• OV* 


3 

• 0220 

' band. 

= 5* 

r N . « 

15/ 

MAX »a 

38/ 

MEAN 

r 

23 

♦?o9/ 

ST 

• DV* 

= 

5 

*2637 

rand* 

= 

MT N . « 

17/ 

MAX ,r 

38/ 

MEAN 

r 

25 

*035/ 

ST 

*DV* 

s 

4 

• 1469 

BAnD* 

a 7j 

MIN** 

5/ 

MAX ♦ a 

16/ 

mean 

= 

11 

• 453/ 

ST 

*Dv • 

3 

2 

•2447 

channel » 

5 

•4 













BANDj, 

= 4* 

M I N . a 

15/ 

MAX .a 

25/ 

mean 

s 

23 

*207/ 

ST 

*DV* 

s 

1 

*8329 

rAND* 

a fi* 

M T , « 

1 6/ 

MAX.S 


MP AM 

= 

o < 

.7Q'3. 

Sf 

. n >/ . 


4 


band* 

a ^ / 

It 

• 

4— 

!-♦/ 

MAX • s 

34/ 

mean 

s 

24 

• 770/ 

ST 

• Dv • 

X 

4 

•0278 

land* 

= 7/ 

Min, a 

5/ 

MAX ,a 

15/ 

mean 

3 

10 

.770/ 

ST 

*0v* 

s 

2 

•0550 

channel s 

6 














band* 

s 4/ 

M I M , a 

cO' 

MAX, a 

3?/ 

mean 

s 

24 

.900/ 

ST 

*D7* 

3 

2 

*6140 

band* 

a b* 

M I N . a 

lb/ 

Ml AX , c 

35/ 

MEAN 

s 

23 

•571/ 

ST 

» Dv * 

= 

4 

*8843 

band* 

= 6i 

M’N.a 

lb' 

MAX .a 

35* 

MEAN 

= 

25 

.386/ 

ST 

*DV* 

3 

3 

*9939 

BAND* 

» 7/ 

MIN, a 

6/ 

MaX , 3 

16/ 

MEAN 

S 

li 

*429/ 

ST 

• Dv* 

- 

2 

• 0603 
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Table II (continued) 


class ; 70 - — - g 


channel = 1 


band. 

* 4# 

M I N . a 

20/ 

MAX.s 

27/ 

mean 

s 

?4 

»237/ 

5T 

• Oy . 

n 

* 

1 

• 3272 

(3 A N D , 

s 5/ 

N I N , » 

1 

MAX .a 

27/ 

mean 

7 

18 

• 1'5S / 

5T 

*0v 

2 

c 

• 690^ 

AND^^ 

= A# 

NIN.a 


MAX.r 

AB* 

MEAN 

r. 

37 

* 60b/ 

ST 

• DV» 

3 

3 

•9435 

band* 

= 7/ 

K T N . = 

loi 

MAX.c 

31/ 

ME-an 

» 

23 

♦ 526 / 

ST 

• Dv 

B 

3 

• 0927 

channel = 

2 


• 



..... 









band. 

a 4 , 

MN.a 

20/ 

MAX.c 

28/ 

mean 

s 

24 

•367/ 

ST 

• Dv' • 

X 

1 

• 7693 

band* 

= 5^ 

M1N,= 

14/ 

max .5 

26/ 

mean 

s 

18 

.959/ 

ST 

• D'v 

3 

3 

• 0968 

BANDk 

= h* 

M I N » = 

2D/ 

MAX.s 

46 / 

MEAN 

7 

37 

•6 33/ 

ST 

• Dy . 

X 

4 

• 6585 

SAND, 

= 7, 

MIN.* 

17/ 

. 5 

30/ 

mEan 

= 

22 

•755/ 

ST 

• Dv* 

3 

3 

•0741 

channel 5 

3 

1 













band. 

« 4 / 

MjN.s 

22 / 

MAX .= 

22 / 

mean 

:z 

23 

• 680/ 

ST 

• Dv • 

3 

1 

• 7826 

B AnD, 


min.* 

14/ 

MAX.s 

23/ 

me: an 

IS 

18 

•100/ 

ST 

• Dy . 

S 

2 

• 220 A 

band. 


M I N . = 

3l / 

Ml A X . 3 

/ 

MEAN 

5 

38 

• 080/ 

ST 

. 0 V . 

s 

4 

• 0440 

band* 

• 7# 

M I N . « 

17/ 

MAX .= 

28/ 

mean 

s 

22 

• 840/ 

ST 

• Dy • 

3 

2 

♦ 7376 

channel s 

4 














band, 

S 4 / 

MIN.= 

21/ 

MAX.s 

27/ 

mean 

5: 

23 

• 839/ 

fS 

f 

• Dv* 

% 

1 

• 8854 

■ band* 

- 5/ 

min.= ■ 

lb/ 

MAX . E 

26/ 

mean 

c 

18 

• 5 4 S / 

ST 

• Dv ♦ 

s 

3 

•3199 

band. 

» 6 ^ 

M I N , = 

28/ 

MAX . a 

44/ 

MEAN 

r 

37 

•645/ 

ST 

. 0 V . 

S 

4 

•6458 

band# 

? 7/ 

M J N . * 

14/ 

MAX.s 

27/ 

mean 

s 

22 

•371/ 

ST 

* Dy • 

B 

3 

• 2397 

channel c 

5 

1 













band. 

5 4/ 

MIN. a 

19/ 

MAX.s 

25/ 

MEAN 

s 

23 

.P67/ 

ST 

• Dy. 

3 

1 

• 5261 

band. 

- 

r ’ r.! . a 

16/ 

MAX.s 

26/ 

mean 

=. 

13 

•933/ 

ST 

. 0 V • 

3 

2 

♦5421 

p.and. 

- 6> 

^IN.* 

30/ 

MAX . s 

5C/ 

mean 

s 

. 3.8 

♦ 46?/ 

ST 

»Dy« 

3 

4 

• 9648 

band. 

r 7i 

M I . s 

19/ 

MAX . s 

26/ 

mean 

s 

22 

•533/ 

ST 

♦ ov» 

3 

2 

♦ 1250 

channel 5 

6 














band. 

s 4/ 

min.s 

21/ 

MAX ,= 

27/ 

mean 

s 

23 

• 645/ 

ST 

• Cy • 

3 

2 

• 0407 

tjAND# 

a 

MTM.s 

13/ 

MAX . 3 

26/ 

MEAN 

s 

18 

• 065/ 

ST 

♦ Dy , 

B 

? 

• 8i63 

band. 

=» 

KIN.* 

?is/ 

MAX.s 

;d/ 

mean 

r 

38 

•806/ 

ST 

tOy. 

3 

4 

•8952 

band. 

- 7/ 

MIN', a 

17/ 

MAX .3 

30/ 

mean 

B 

2? 

• 290/ 

St 

• Dy . 

3 

3 

• 1335 


beproducibility of the 
Snae page is poor 



-37- 


Table n (concluded) 


----- CL/-SS ; 80 9 — Urban 


ChANNEt = 






= 4 ^ 

M ! N , s 


H.AX .3 

BAMD, 

• 

V I V , s 

18^ 

MAX * s 

:jAVr, 

= 6^ 



VAX .= 


3 7 i 

M T \| . s 

13* 

MAX.c 

C H A N Si E L - 

2 


■ 


bano. 

3 4# 

M N . = 


MAX.s 

band» 

* f: i 

H 1 N , - 


MAX.i 

mAND* 

= 6/ 

M IM," 


MAX ♦ = 

5ANP» 

3 7 0 

M 1 N , = 

1<> * 

MAX* = 

CHANvEU e’ 

3 


' 


band. 

^ ^ i 


E6 ^ 

MAX • s 

band* 

s b/ 

M ; N . s 


MAX » = 

band* 

- A/ 

M P! t = 


MAX . = 

band* 

- 70 

M I N . 

V3^ 

HaX .= 

channel * 

4 




band, 


H I N . * 


MAX ♦ = 

band* 

s bp 

H 1 V . = 


M AX *s 

band* 

« 

K 1 N r s 

t' £ •' 

M A X » a 

band* 

« 7/ 

M J N . s 

13/ 

M.aX • c 

channel = 

o 




u ANDjj 

^ 4 / 

M 1 N . = 

25/ 

f-^A X ♦ s 

. . • % - r\ 

t % p- ^ 

* ^ p 

► f 1 '*! • " 

' 

<*;t« A ♦ r 

BAN'O* 

r A/ 

M I N . r 

27/ 

MAX . s 

BAnD* 

f 7i 

H I N * » 

U/ 

MAX. 3 

channel s 

6 




R AND* 

- 4 / 

KIN.s 

2b/ 

MAX. 8 

band* 

= B/ 

H p.; . s 

22/ 

MAX .= 

band* 

= A i 

H I N . t 

?£>/ 

MAX .= 

B AnD* 

= 7/ 

M j • * 

11/ 

MAX .= 


*STep» OooOOOCj 


31/ 

Mr. AN E 

26*970/ 

3T.D'/. 


27/ 

MEAN <f 

r 

3 ’1’ /a D V' * 

» 1*5230 

3'f/ 

mean a 

31.051/ 

ST • 0 V • 

= l.yO'M? 

19/ 

mean = 

16»5l5/ 

ST*DV 

= 1«A53C 


cS/ 

mean s 

26*3?0» 

ST*DV. 

= 1 

• 2238 

26/ 

mean = 

22 * 080/ 

G T . D V • 

s 1 

• 5728 

39/ 

MEAN 8 

31 *620/ 

ST » D V » 

» 2 

•7701 

22/ 

MEAN = 

16*320/ 

ST.Dv* 

= 1 

• 7600 


33/ 

MEAN s 

23*000/ 

ST.Dv. 

s l»9i‘f9 

27/ 

mean s 

23*333/ 

ST .Dv 

8 5,4944 

3S/ 

MEAN a 

31*750/ 

ST ,Dv . 

= 2.9150 

20/ 

mean s 

16*583/ 

ST .Dv ♦ 

= 2*2358 


32/ 

MEAN = 

28*000/ 

ST ♦ D V . 

* 1 

f 4 60^ 

3C/ 

mean a 

24. Aqc). 

ST » D V * 

= 1 

•S^933 

38/ 

mean = 

33* 133/' 

ST.Dv* 

a 3 

* d c: A t; 

-.0 •hJ w 

20/ 

mean s 

17-600/ 

S T * E> V • 

= 2 

*0591 


29/ 

me. AN = 

27*000/ 

ST.Dv. 

= 1.297S 

26/ 

I'i E * N s 

2 3 • 6 j H / 

D r « u V » 

= 1*9252 

38/ 

MEAN a 

33.316/ 

ST.DV« 

5 3o2453 

20/ 

mean = 

i6‘4aii 

5 T * D V « 

= 2*3241 


32/ 

MEAN s 

27.700/ 

ST.Dv* 

» 1 

» 320 4 

29/ 

MEAN * 

2 ^ • 533/ 

ST.Dv. 

= 2 

* 04 50 

36/ 

MEAN n 

31 *S33/ 

ST » D V . 

3 0 

*4325 

21/ 

mean = 

!6‘ 167/ 

ST.Dv* 

=* 2 

,.5939 
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‘ Figure 4. 



Class and channel Dispersion 



7^6,1 

\ 

1 4 = 23.07 

CLASSES 1-9 

3 2,4 5 8/ 9 

1 1 ,) 1 II 1 

i 


1 III ^ 


CHANNELS 1-6 


= 0.105 

channel 

! 

3 

1_ 

4 2 8 5 7, 6,9,4 

— U u LU 

4 = 19.2 

= 19.34 

• class 


CT = 0.068 

channel 


00 

I 


3^4 2 5;5 7 1 9 8 

t I ..... — n 


6 — 




II 1 1 

j , = 77.69 

1 1 class 





flit 

= 0.077 

channel 


3 4 

6 7 5^ 

9 1 

8 

4 = 12.11 

7 — 

_U 

LI1_ 

1 1 

^ 

''class = 39-09 
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Table III. 

Confusion Matrix for the Trout Lake Area 


Confusion Matrix in Percent; 

Classifier Assignment 




. .. 1/ 

2/ , „ 

• 3/ - 

- 4/ 

5 / 


Hardwood 

1/ • 

- 77.3 

. .4 • 

• 0 

4.6 

9.6 

8.1 

Conifer 

2/ 

10*0 

80*3 

3*1 

5.8 

.3 

1 »6 

Water 

3/ 

•0 

3.1 

95.4 

1‘5 

.0 

*0 

Mines 

4/ 

8*6 

9*6 

4.2 

70*1 

• 3 

7.2 

Open 

5/ 

7*5 

.5 

*0 

«0. 

S7.9 

4.2 

Urban 

6/ 

5.2 

.0 - 

• 0 

11*9 

3.0 

79.9 


Confusion Matrix in Samples; 


4?q . 2 0 - • 25 - 5? 44 - Overall accuracy; 

-3.4 - 3o6 S' - 6 

0 10 31 a 5 0 0 

IOr lei 53 8S0 4 90 

16 1 0 0 188 9 

7 0 0 16 4 107 


2855-642 ^ 


2855 


77.6% 
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many of these erroneous designations. Often the values within a data block 
were very uniform except for one or two points on the periphery. These 
peripheral points usually fell on or very near the minimum- maximum sheets. 
Upon checking the location of these points on the density level map. it was, 
in most cases, found that they were not actually part of the class with which 
•fliey had been grouped, but were part of a neighboring class. 

The inclusion of these peripheral points in the wrong class caused an alter- 
ation of the minimum- maximum values of that class. That is to say, the in- 
clusion of some hardwoods with open would cause the distinctions between the 
two classes, expressed by the minimum-maximum values for each channel to 
be blurred. Thus, numerous occasions would arise when an element could be 
classified in one of two ways. A correction of these erroneous training samples 
would eliminate some of the extremes in classes (i. e., lower the maximum and 
raise the minimum); therefore, further separating the classes and allowing 
the K- Class program to operate more effectively. 

hi two instances, the wrong column or row number was punched onto 
the computer cards. Thus a whole training set data block was erroneously 
classed. Correction of this error would have the same effect as the correc- 
tion of the previous type; elimination of extreme values in classes. 

The misses thus far discussed have been human errors. Others 
occurred because of the overlap between classes. For two classes to be 
distinguished from one another 100 percent of the time, none of their values 
may overlap; i.e,, the maximum of one must be less than the minimum of 
the other. This is generally not the case. 
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Por exan,ple, the values for open and conifer overlap in 
bands 4 and 5 but not in 6 and 7. conifer and urban overlap 
in bands 5 and 7, but not in 4 and 6. The two classes of water . 
and conifer overlap in bands 4 and 5, but not in 6 and 7. Natural 
water can be distinguished from water in mines by band 4 only. 

This is an important distinction,- because, natural waters 
(lakes, rivers, reservoirs) can be used for recreational 
purposes, while water in mines can not. Hence, when taking a 
total of all useable waters in an area, water in mines should 
not be included. Hardwoods can be distinguished from tailings, 
mines, and wetlands by band 7 only, whereas hardwoods and conifers 
can be distinguished by 6 and 7, they cannot by 4 and 5. m 
over half of the misses that occurred the value of at least 
one of the bands did not fit within the minimum-maximum values 


for the class decision. 

Some classes, however, overlap at least to some degree , in 
all of the four bands. Mines and tailings overlap, but this is 
not of extreme importance because they are both in the class 
extractive. Hardwood and open coropletely overlap, as do hard- 
wood and urban. in the case of the former, however, their means 
are different enough to suggest that the elimination of the 
erroneous peripheral data elements mentioned earlier may cause 
their values to separate. This seems much less likely in the 
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case of the latter. Open and urban also overlap on all four bands. 
Their overlaps are slight enough, especially in band 4, that 
they could at least be lessened by eliminating erroneous data. ^ 

A comparison was made of the data from the Trout Lake area 
with that of Pineville, located northwest on the same photo. 

The two areas were analyzed using a different method for deriv- 
ing the ground truth and therefore the training samples. The various 
classes were compared using the minimum-maximum sheets of the 
two areas. . • 

Values for hardwoods in the two areas matched in bands 6 
and 7 with the mean for Pineville being slightly higher, but not 
in bands 4 and 5, where the Trout Lake means are higher. Conifers 
matched except for band 5 with, again. Trout Lake's mean being 
higher. Urban matched well in all four bands. Open matched in- 
band 6, but only overlaps in bands 4, 5, and 7 with Trout Lake’s 
mean consistently higher. No comparison could be made of extrac- 
tive due to the differences in the extractive classes used (ex- 
tractive 1, 2, 3 and 4 in the Pineville study verses mines, 
tailings, and water in mines in the Trout Lake study.) 

These differences may have occurred due to the relative 
locations of the areas on the photo or more likely, they are 
due to differences in ground truth interpretation, since the two 
studies match in such easy to identify classes as urban. 
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The ground truth for Pineville was delineated with high altitude and regular 
aerial photos, while Trout Lake was delineated using landuse interpretations 
from ERTS photos with aerial photos as a backup. In the latter case, quali- 
fied land use interpreters had already decided what could or could not be desig- 
nated from satellite photos. 

Since ground truth for the two areas from photo 1075-16312 was not 
obtained in the same manner, ' the comparison of the results of the two areas 
cannot be considered conclusive. In the next experiment to be described, two 
areas (Pineville and Virginia) on the same photo were analyzed using the same 
procedure for obtaining the ground truth. This study was performed to deter- 
mine the transferrability of class interval designations from one part of an 
ERTS image to another. 

Pineville Virginia Area 

The next ERTS image analyzed was 1057-16311, a September 18th cover- 
age over Virginia and Pineville in northern Minnesota. These two cities are 
approximately 20 miles apart. In this experiment the training set was obtained 
over Pineville and the training weights obtained were applied to the Virginia 
area. This particular image was cloud-free but contained banding on MSS 4 
and MSS 6. 

One of the six channels on band 7 (channel 2) was attenuated with respect 
to tlie other five. This produced an image with repetitive bands spaced six 
scans apart. A signal of this tjrpe and its Fourier Transform is shown in 
Figure 5. The Fourier Transform computed from the ERTS data in which 
every sixth line is attenuated is shown in Figure 6. This spectrum was com- 
puted from word 200 of 256 scan lines; that is, from one column of the image 
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Figure 5. 
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PINEVILLE (1057-16311) 
BAND 4 


Figure 6. Power Spectral Density 
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matrix. A direct comparison can be made between tlie spectrum from band 
4 in Figure 6 to the spectrum of band 5 on Figure 7. Both have the same 
vei1;ical scale. Since band 5 does not have banding (all 6 channels have equal 
gain), the two spikes at 43 and 86 are absent. ' 

Band 6 had one channel (channel 5) with intermittent drop-outs provid- 
ing a broad spectrum of noise, as shown in Figure 8. This curve can be 
compared with band 7 in Figure 9, a band having no drop-outs. As expected, 
power spectrum on Figure 9 is much lower. 

Because of the banding problem, three classifier training sets were de- 
rived; that is, classification weights were determined for channels having 
consistent gain 1, 3, 4, 6. Separate weights were obtained for channel 2 be- 
cause of the attenuation on band 4. Separate weights were also obtained for 
channel 5 because of its sporadic dropout on band 6. Thus one set of weights 
were used on scan lines 1, 3, 4, 6, 7, 9, 10, 12, etc. Another set of weights 
were used on scan lines 2, 8, 14, 20, etc. A third set of weights were used 
for scan lines 5, 11, 17, 23, etc. 

The classification accuracy for the three is approximately equal. The'* 
percentage confusion matrix for six classes based on four channels 1, 3, 4, 6 
of data is shown in Table IV . 
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Figure 8. Power Spectral Density 
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Pineville Confusion Matrix in Percent 


>v,^^Classif ier 
^XAssign. 

Ground truth\. 

Hdwd 

Con 

Res 

Open 

Mine 

Tailings 

Hardwood 

89.9 

4.3 

.6 

5.2 

0 

0 

Conifer 

3.0 

89.7 

0 

.4 

.4 

6.5 

Residential 

0 

0 

95.7 

3.2 

1.1 

0 

Open 

33.7 

1.2 

9.3 

55.8 

0 

0 

Open Pit Mine 

0 

.9 

6.5 

.9 

89.8 

1.9 . 

Tailings 

0 

7.5 

0 

0 

2.5 

89.9 


The overall accuracy for the data from these four channels was 87, 5%. 
The overall accuracy is computed by summing the samples on the diagonal of 
the sample confusion matrix shown in Table V and dividing this sum by the 
total of all samples. 





Table V 


Pineville Confusion Matrix, Samples 


classifier 

^"^Assign. 

Ground Trutm'''v^ 

H 

c 

R 

0 

M 

T 

Hardwood 

.2 95 

14 

2 

17 

0 

0 

Conifer 

7 

208 

0 

1 

1 

15 

Residential 

0 

0 

89 

3 

1 

, 0 

Open 

29 

1 

8 

48 

0 

0 

Open Pit Mine 


1 

7 

1 

97 

2 

Tailings 

1 

0 

12 

0 

0 

4 

143 


The classification accuracy based on the data from channel 2 was 89.7%, 
The classification accuracy computed from channel 5 was 86.5%. One would 
expect this to be lowest because of the sporadic dropout. One would also 
expect that the accuracy from a single channel of data would be higher than 
from a combination of channels having slight variations in gain. This is 
evidenced by the performance on channel 2, where the accuracy is highest 
even though the signal is attenuated. 

The thematic map generated for the Pineville area is shown in Figure 
10. The classes in going from light to dark are Open, Conifer, Hardwood, 
Urban, Tailings, Open Pit Mines, and Water. 
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FIGURE 10 PINEVILLE THEMATIC MAP 
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This thematic map was generated using a sequential test to separate 
out water and tailings. Water was isolated by those values less tlian 5 on 
band 7. Tailings were determined by those values greater than 27 on band 
5. The other six classes were classified with K-Class based on all four MSS 
bands. Open pit mines appeared to have a bi-modal distribution and were 
broken down into two classes. A block diagram of the classification process 
is shown in Figure 11. ’ 

A comparison can be made between the computer generated thematic 
map in Figure 10 and the ground truth map shown in Figure 12. The two maps 
are very similar, indicating that most of the classes are detected accurately. 
As shown on the thematic map, the class "Open" appears to have the greatest 
number of errors. The scattered single "Open" indications in the thematic 
map in Figure 10 may actually be isolated Open areas in the Forested region. 
One does, however, expect a larger error in this class, as seen in the con- 
fusion matrix in Table IV . One reason for the large error in this class may 
be because of the relatively small number of training samples, r- 

Data from channel 7 of 1129-16320, a November 29, 1972 coverage, was 
added to the September coverage in an effort to improve classification accuracy. 
With five bands of multispectral data, a classification accuracy of 89% was 
obtained for the four channels 1, 3, 4, 6. This compares with 87.5% 
obtained with the single September coverage only. 

The next experiment performed was the application of the weights ob- 
tained over Pineville to the Virginia area. The three sets of weights were 
used for tlie different channels; that is, a set for scan lines 1, 3, 4, 6, a 
set for scan line 2 and a set for scan line 5. Figure 13 shows the thematic 
map generated over Virginia based on the Pineville weights. The confusion 
matrix for Virginia was derived by selecting random samples from the image 
and comparing the actual class of that point with the classifier assignment. ' 

The confusion matrix obtained in this manner is shown in Table VI 
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Figure 11. Sequential Classification Procedure 
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FIGURE 13 VIRGINIA THEMATIC MAP 



Table VI. 

Virginia Confusion Matrix 


classifier 

^S^ssign. 

Ground truth\. 

F 

R 

0 

M 

W 

T 

Forest 

3745 

26 

. 31 

6 

1 

6 

Resident ial 

84 

155 

94 

31 

1 

39 

Open 

376 

30 

484 

15 

0 

16 

Open Pit Mine 

42 

9 

17 

203 

2 

85 

Water 

11 

0 

1 

1 

138 

5 

Tailings 

21 

22 

10 

57 

5 

198 


The overall classification accuracy of 82.4% is 5. 2% less than the 
overall classification accuracy over Pineville. This indicates that perform- 
ance does not drop appreciably when the weights obtained from a training set= ' 
are used for a test set which may be an area 20 miles away. A second eval- T 
nation of the accuracy of classification can be made by comparing the computer 
generated thematic map in Figure 13 with the ground truth map in Figure 12. 

Ramsey County Metropolitan Area 

The final area considered in this study consisted of Ramsey County, 
which contains the city of St. Paul, Minnesota. Data for this area was ob- 
tained from the ERTS coverage 1075-16321, an October 6th, 1972 cloud-free 
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overpass, The four multispectral bands were used for features in perform- 
ing automatic classification. In addition, textural features were used from 
bands 5 and 7. Severe banding was experienced on Band 4 because channel 2 
provided a very low signal output. To accomodate this anamoly, two sets of 
classification weights were utilized; one set was derived from all four bands 
but excluded channel 2. The second set used channel 2 from bands 5, 6 and 7 
to fill in the missing data. 

Three experiments were performed as follows; 

1. Using multispectral data only. 

2. Using multispectral plus texture from band 7. 

3. Using multispectral plus texture from bands 5 and 7. 

Four classes were delineated- -Water, Residential, Open and Urban, 
These classes were selected on the basis of availability of training data. The 
two classes Agriculture and Forest are included in the Open class, since very 
few samples of this type are available in Ramsey County, 

The multispectral features consisted of the four MSS bands (or three 
for channel 2), Texture was added to these features. Texture was obtained 
from the Slant Transform coefficients obtained from a 4 x 4 array of data 
points. When only multispectral data is used, the classification accuracy is 
about 94%. This performance is obtained using the same data set for training 
and testing. By adding texture, the performance increased to 100%; however, 
this again is based on the same training set and test set. 

A more realistic evaluation can be obtained by comparing the thematic 
maps generated by the various procedures. The thematic map obtained using 
multispectral data alone is shown in Figure 14. The four classes Urban, 
Water, Open and Residential are plotted from dark to light. 




FIGURE 14 RAMSEY COUNTY THEMATIC MAP US ING MSS DATA ONLY 
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When adding texture from band 7, the thematic map shown in Figure 15 
was obtained. This map was made by making class assignments on individual 
pixels; however, the texture portion of the feature was constant over the 
4x4 array. This accounts for the block like appearance of the thematic 
map. Most of the block sizes are 4x4. 

- The actual ground truth for Ramsey County is shown in Figure 16. Al- 
though this map is 5 years old, the general location of these four classes is 
accurate. In selecting the training set, updated land use maps were used. 
These were done by Joe Gibson of the University of Minnesota Department 
of Geography. 

IV. Conclusions 

It has been shown that automatic classification of broad classes simi- 
lar to Anderson Level One can be performed with an accuracy better than 
90%, Multi-temporal coverage is very helpful for improving classification 
accuracy. An improvement of 1. 5% was obtained over Pineville by adding a 
November coverage to a September coverage. Texture is also useful for 
improving classification accuracy, an improvement of 6% was noted in the 
metropolitan area study. 

Automatic classification using digital data can distinguish more classes 
and smaller areas than manual photointerpretation on imagery. For example, 
it has been shown that water bodies of less than an acre size can be detected 
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